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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

In re application of: FORSSIVIANN, et al. 

App. No.: National Stage of PCT/EP98/08424 

Filed: 23 December 1998 

For: SERINE PROTEASE INHIBITORS 

PRELIMINARY AMENDMENT 

Assistant Commissioner of Patents 
Washington, D.C. 20231 

Sir: 

Prior to calculating the filing fee, please amend the captioned application as follows. 

IN THE CLAIMS 
Cancel claims 1-20 without prejudice or disclaimer. 
Add the following claims. 

21. A serine protease inhibitor, characterized by having a domain with four cysteines, 
and a sequence of 13 amino acids is present between the first and second 
cysteines, a sequence of 1 8 amino acids is present between the second and third 
cysteines, and a sequence of 2 amino acids is present between the third and fourth 
cysteines. 

22. The serine protease inhibitor according to claim 21, characterized in that the 
sequence of the domain between the first and second cysteines is selected from 
HEFQAFMKNGKLF, SEYRKSRKNGRLF, 

DDFKKGERDGDFI, SEFRDQVRNGTLI, 
SAFRPFVRNGRLG, SEYRHYVRNGRLP, 



KEYEKQVRNGRLF, DEFRRLLQNGKLF, 
SQYQNQAKNGILF, AEYREQMKNGRLS, or 

NEYRKLVRNGKLA, DEFRSQMKNGKLI. 

23. The serine protease inhibitor according to claim 21, characterized in that the 
sequence between the second and third cysteines is selected from 
PQDKKFFQSLDGIMFINK, TRENDPIQGPDGKMHGNT, 
TRENDPVLGPDGKTHGNK, TREHNPVRGPDGKMHGNK, 
TRESDPVRGPDGRMHGNK, TRENDPIEGLDGKIHGNT, 
TRENDPIRGPDGKMHGNL, TRENDPVRGPDGKTHGNK, 
TRENDPIQGPDGKVHGNT, TRESDPVRDADGKSYNNQ, or 
TRESDPVRGPDGKTHGNK. 

24. The serine protease inhibitor according to claim 21, characterized in that the 
sequence between the third and fourth cysteines of the domain is selected from 
AT, AL, AM, SM, or TM. 

25. The serine protease inhibitor according to claim 21, having one of the following 
formulas: 

R1-C-HEFQAFMKNGKLF-C-PQDKKFFQSLDGIMFINK-C-AT-C-R2 

R1-C-DDFKKGERDGDFI-C-PDYYEAVCGTDGKTYDNR-C-AL-C-R2 

R,-C-SAFRPFVRNGRLG-C-TRENDPVLGPDGKTHGNK-C-AM-C-R2 

R,-C-KEYEKQVRNGRLF-C-TRESDPVRGPDGRMHGNK-C-AL-C-R2 

R1-C-SQYQNQAKNGILF-C-TRENDPIRGPDGKMHGNL-C-SM-C-R2 

R,-C-NEYRKLVRNGKLA-C-TRENDPIQGPDGKVHGNT-C-SM-C-R2 
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RrC-SEYRKSRKNGRLF-C-TRENDPIQGPDGKMHGNT-C-SM-C-R2 

RrC-SEFRDQVRNGTLI-C-TREHNPVRGPDGKMHGNK-C-AM-C-R2 

RrC-SEYRHYVRNGRLP-C-TRENDPIEGLDGKIHGNT-C-SM-C-Ra 

R1-C-DEFRRLLQNGKLF-C-TRENDPVRGPDGKTHGNK-C-AM-C-R2 

R,-C-AEYREQMKNGRLS-C-TRESDPVRDADGKSYNNQ-C-TM-C-R2 

RTC-DEFRSQMKNGKLI-C-TRESDPVRGPDGKTHGNK-C-TM-C-R2, 

wherein R^ is NH2, an amino acid, or a peptide with up to 1000 amino acids, and 

R2 is COOH, CONH2, an amino acid, or a peptide with up to 1000 amino acids, 

26. The serine protease inhibitor according to claim 21, characterized by containing 
a disulfide bridge between the first and fourth cysteines and/or between the second 
and third cysteines; or 

a disulfide bridge between the first and a fifth cysteine and/or between the second 
and fourth cysteines and/or between the third and a sixth cysteine, 

27. The serine protease inhibitor according to claim 21, characterized by being a 
fragment of VAKTI-1 (SEQ. ID. NO, 1) or VAKTI-2 (SEQ. ID, NO, 2), 

28. The serine protease inhibitor according to claim 27, characterized by being 
HF 6479 (SEQ. ID, NO. 3) or HF 7665 {SEQ, ID. NO. 4). 

29. A nucleic acid coding for a serine protease inhibitor according to claim 21 , 

30. A medicament containing 

the serine protease inhibitor according to claim 21 , 

a nucleic acid coding for the serine protease inhibitor, or 
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the serine protease inhibitor and the nucleic acid coding for the serine protease 
inhibitor, 

together with pharmaceutical vehicles. 

31. The nnedicannent according to claim 30, containing from 0.01 to 1000 mg per kg 
of body weight of the serine protease inhibitor. 

32. Method of using the medicament according to claim 30, wherein the medicament 
is the serine protease inhibitor, for the treatment of acute or chronic cervix 
inflammations, inflammations of Bartholin's glands and other vaginal regions, 
tonsillitis, pharyngitis and laryngitis, acute or chronic inflammatory processes 
accompanied by excessive formation of mucus and the resulting acute emergency 
situations, postoperative bleeding due to hyperfibrinolysis, and for the prophylaxis 
of lung emphysema formation in deficiencies of a^-proteinase inhibitor. 

33. Method of using the medicament according to claim 30, wherein the medicament 
is the nucleic acid coding for the serine protease inhibitor, in gene therapy for the 
treatment and prophylaxis of acute or chronic cervix inflammations, inflammations 
of Bartholin's glands and other vaginal regions, tonsillitis, pharyngitis and laryngitis, 
acute or chronic inflammatory processes accompanied by excessive formation of 
mucus and the resulting acute emergency situations, postoperative bleeding due 
to hyperfibrinolysis, and lung emphysema formation in deficiencies of a^-proteinase 
inhibitor. 

34. Antibodies or antibody fragments against epitopes of the serine protease inhibitor 
according to claim 21 . 
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35. Poly- or oligonucleotides which will hybridize to regions of the cDNA or 
corresponding RNA under stringent conditions and optionally prevent the 
expression of coding regions of the genes coding for the serine protease inhibitor 
according to claim 21 . 

36. A diagnostic agent containing at least one of the antibodies or antibody fragments 
according to claim 34. 

37. A medicament containing the antibodies or antibody fragments according to claim 
34 in therapeutically effective amounts. 

38. Method of using the medicament according to claim 37 for the treatment of 
diseases involving too high an expression of a serine protease inhibitor, 
characterized by the antibodies or antibody fragments having a domain with four 
cysteines, and a sequence of 13 amino acids is present between the first and 
second cysteines, a sequence of 18 amino acids is present between the second 
and third cysteines, and a sequence of 2 amino acids is present between the third 
and fourth cysteines. 

39. DNA coding for the serine protease inhibitor according to claim 21 . 

40. The DNA according to claim 39 having the SEQ. ID. NO. 5 or SEQ. ID. NO. 6. 

REMARKS 

Claims 21-40 are presented for consideration.. 

Claims 21-40 correspond to canceled claims 1-20, respectively, revised to eliminate 
multiple dependencies and to, otherwise, more clearly define the instant invention. 
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Favorable action is requested. 

Respectfully submitted, 

William E. Player M 
Reg. No. 321,409 

JACOBSON, PRICE, HOLMAN & STERN, PLLC 

JACOBSON, PRICE, HOLMAN & STERN, PLLC 

400 Seventh Street, N.W. 

Washington, D.C. 20004 

Tel. No.: 202-638-6666 

Atty. Dkt. No. P65678US0 

WEP/id 

R:\HOME\emennillo\June\P65678-PRE 
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Serine Protease Inhibitors 



The present invention relates to serine protease inhibitors, cDNA coding for serine 
protease inhibitors, medicannents containing such Inhibitors or their coding nucleic 
acid, use of the compounds according to the invention for the preparation of 
nnedicaments for the treatnnent of various indications, antibodies or antibody 
fragments against epitopes of the compounds according to the invention, poly- or 
oligonucleotides which will hybridize to genes of the compounds according to the 
invention, a diagnostic agent for detecting the compounds according to the 
invention, and medicaments containing antibodies or poly- or oligonucleotides 
according to the invention. 

Proteolytic processes play an important physiological role in all organisms; a 
distinction has to be made between non-specific and specific proteolytic reactions. 
The former include, for example, the digestion of food in the digestive tract by 
endopeptidases, and the intracellular degradation of used endogenous substances 
and phagocytosed materials by lysosomal proteases. Specific proteolyses mostly 
serve for the conversion of a proenzyme to its active form, as in the conversion of 
trypsinogen to trypsin, and of chymotrypslnogen to chymotrypsin, and in the 
callicreln-klnin cascades and the blood clotting cascade. Depending on the struc- 
ture of the reactive site of the proteinases involved, they are classified into the 
classes of serine proteases (e.g., chymotrypsin, trypsin, elastase and cathepsin G), 
aspartate proteases (e.g., cathepsin D, cathepsin E and pepsin), cysteine prote- 
ases (e.g., cathepsin B, cathepsin H and cathepsin L), and the metailo-proteases 
(e.g., collagenase and thermolysin). 

In order to be able to correct' the proteolytic processes which often proceed in a 
cascade, the organisms is provided with a number of other proteins, the protease 
inhibitors (for a survey, see Laskowski and Kato, 1980, and Bode and Huber, 
1992). Thus, the liver-synthesized human plasma protease inhibitors ai- 
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antichymotrypsin and ai-proteinase inhibitors protect the lung tissue from non- 
specific attack by the proteinases cathepsin G and elastase from polymorphonu- 
clear lymphocytes. When the balance between proteases and their specific 
inhibitors is disturbed, pathological effects may arise. For example, an excess ratio 
of elastase to ai-proteinase Inhibitor increases the risk of formation of a lung 
emphysema by a factor of about 20 to 30 in patients with a genetically caused 
deficiency in this factor as compared to the normal population (Carrel and Owen, 
1980). With smokers, the formation of an emphysema is promoted by oxidation of 
the amino acid methionine which is present in the reactive site of the ai-proteinase 
inhibitor by oxidants contained in cigarette smoke (Miller and Kuschner, 1969; 
Ohisson et al., 1980). Also in the case of infection with Gram-negative bacteria, 
their endotoxins can cause disintegration of phagocytes and thus the secretion of 
lysosomal proteases, which may cause an uncontrolled damage to tissues and 
inflammations due to the increased consumption of protease inhibitors. For this 
reason, certain protease inhibitors have a high therapeutic potential (see, e.g., 
Fritz, 1980). 

It has been the object of the present invention to provide further inhibitors of 
serine proteases. In addition, the genes or cDNA coding for the inhibitors according 
to the invention should be provided. 

A specific feature of the serine protease inhibitors according to the invention is that 
the serine protease inhibitor has a domain with four cysteines, and a sequence of 0 
to 20 amino acids is present between the first and second cysteines, or the serine 
protease inhibitor has a domain with six cysteines, and a sequence of 7 to 20 
amino acids is present between the first and second cysteines. 

Preferably, a sequence of 13 amino acids is present between a first and a second 
cysteine, and/or a sequence of 18 amino acids is present between a second and a 
third cysteine, and/or a sequence of 2 amino acids is present between a third and 
a fourth cysteine. 

It is particularly preferred that the sequence between a first and a second cysteine 
be selected from 
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HEFQAFMKNGKLF, 
DDFKKGERDGDFI, 
SAFRPFVRNGRLG, 
KEYEKQVRNGRLF, 
SQYQNQAKNGILF, 
NEYRKLVRNGKLA, 



SEYRKSRKNGRLF, 

SEFRDQVRNGTLI, 

SEYRHYVRNGRLP, 
DEFRRLLQNGKLF, 

AEYREQMKNGRLS, or 

DEFRSQMKNGKLI 



and/or the sequence between a second and a third cysteine be selected from 



and/or the sequence between a third and a fourth cysteine be selected from 
AT, AL, AM, SM, or TM. 

It is particularly preferred that the serine protease inhibitor according to the 
invention correspond to one of the following formulas: 

R1-C-HEFQAFIVIKNGKLF-C-PQDKKFFQSLDGIMFINK-C-AT-C-R2 

R1-C-DDFKKGERDGDFI-C-PDYYEAVCGTDGKTYDNR-C-AL-C-R2 

R1-C-SAFRPFVRNGRLG-C-TRENDPVLGPDGKTHGNK-C-AM-C-R2 

R1-C-KEYEKQVRNGRLF-C-TRESDPVRGPDGRMHGNK-C-AL-C-R2 

R1-C-SQYQNQAKNGILF-C-TRENDPIRGPDGKMHGNL-C-SM-C-R2 

R1-C-NEYRKLVRNGKLA-C-TRENDPIQGPDGKVHGNT-C-SM-C-R2 

R1-C-SEYRKSRKNGRLF-C-TRENDPIQGPDGKMHGNT-C-SM-C-R2 

R1-C-SEFRDQVRNGTLI-C-TREHNPVRGPDGKMHGNK-C-AM-C-R2 

R1-C-SEYRHYVRNGRLP-C-TRENDPIEGLDGKIHGNT-C-SM-C-R2 

R1-C-DEFRRLLQNGKLF-C-TRENDPVRGPDGKTHGNK-C-AM-C-R2 

R1-C-AEYREQMKNGRLS-C-TRESDPVRDADGKSYNNQ-C-TM-C-R2 



PQDKKFFQSLDGIMFINK, 

TRENDPVLGPDGKTHGNK, 

TRESDPVRGPDGRMHGNK, 

TRENDPIRGPDGKMHGNL, 

TRENDPIQGPDGKVHGNT, 



TRESDPVRDADGKSYNNQ, or 

TRESDPVRGPDGKTHGNK 



TRENDPIQGPDGKMHGNT, 
TREHNPVRGPDGKMHGNK, 
TRENDPIEGLDGKIHGNT, 
TRENDPVRGPDGKTHGNK, 
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R1-C-DEFRSQMKNGKLI-C-TRESDPVRGPDGKTHGNK-C-TM-C-R2, 

wherein Ri is NH2/ an amino acid, or a peptide with up to 100 amino acids^ and R2 
is COOH; CONH2, an amino acid, or a peptide with up to 100 amino acids. 

It is further preferred that the serine protease inhibitor contains one or more 
disulfide bridges. It is particularly for it to contain a disulfide bridge between the 
first and fourth cysteines and/or between the second and third cysteines, or to 
contain a disulfide bridge between the first and fifth cysteines and/or between the 
second and fourth cysteines and/or between the third and sixth cysteines. 

Preferred representatives of the serine protease inhibitors according to the 
invention are the compounds HF 6479 and HP 7655, and fragments of proteins 
VAKTI-l and VAKTI-2 according to Figures 1 and 2. 

In addition to the amino acid sequence of the preferred compounds according to 
the invention, further information about the cDNA coding for the compounds 
according to the invention can also be seen from Figures 1 to 3. In particular, the 
corresponding motives and primer-hybridizing sites are indicated. 

Compound HF 3479 according to the invention has a mass of 6,479 Dalton, and 
that of HF 7665 is 7,665 Dalton; both have been purified from hemofiltrate. 

According to the invention, a cDNA coding for the compounds according to the 
invention, especially a cDNA having the nucleic acid sequence according to Figures 
1 to 2, is also claimed. 

The compounds according to the invention are useful as medicaments. In this 
case, they are administered together with pharmaceutically acceptable vehicles. 

The medicaments according to the invention containing the protease inhibitors 
according to the invention are preferably administered In amounts of from 1 to 100 
nng/kg of the patient's body weight. As the dosage form, all galenic formulations 
for peptide active substances may be used. The medicaments containing nucleic 



- 5 - 



acids according to the invention are preferably administered in amounts of from 
0.1 to 100 mg/kg of body weight of a corresponding patient. In this case, the 
galenic dosage forms which may be used are those which are suitable for the 
administration of nucleic acids without rendering the nucleic acids ineffective by 
metabolic influences before they have reached their site of action. For example, 
liposomes in which the nucleic acids are contained can be employed as a galenic 
dosage form. 

The compounds according to the invention can be used, in particular, for the 
treatment of acute or chronic cervix inflammations, inflammations of Bartholin's 
gland or other vaginal regions, tonsillitis, pharyngitis and laryngitis, acute or 
chronic inflammatory processes accompanied by excessive formation of mucus and 
the resulting acute emergency situations, postoperative bleedings due to hyperfi- 
brinolysis, and for the prophylaxis of lung emphysema formation in deficiencies of 
ai-proteinase inhibitor. 

The compounds according to the invention can be administered in deficiencies of 
serine protease inhibitors to correct endogenous defects. The nucleic acids may 
also be used in gene therapy, either directly or coupled to suitable vehicles. 
Suitable vectors include, in particular, attenuated adenoviruses into which the 
corresponding genes have been incorporated. 

The polypeptides according to the invention, especially VAKTI-I and VAKTT-II, can 
serve for the preparation of antibodies or antibody fragments. These are simply 
prepared by the immunization of appropriate mammals. By per se known opera- 
tions, the antibodies may also be humanized so that such antibodies can also be 
employed for therapeutic use. Antibodies or antibody fragments can then by 
employed for the regulation of diseases in which the protease inhibitors are 
expressed in a pathological way. Also, antisense nucleic acids complementary to 
the nucleic acids according to the invention may also be employed in therapeutical 
use in overexpressions of the protease inhibitor genes. 
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The compounds according to the invention can be easily prepared by per se known 
methods of peptide or nucleotide synthesis. Preparation of the compounds by 
genetic engineering is also possible. 

Those skilled in the art will recognize that fragments of the polypeptides according 
to the invention may also be used provided that they retain the inhibitory proper- 
ties of the serine protease inhibitors. Those skilled in the art know how to find such 
fragments. Thus, this may be accomplished, for example, by a selected enzymatic 
cleavage of the compounds according to the invention. Side-chain modified amino 
acids may also be employed. N- or C-terminally modified polypeptides may also be 
used. In particular, phosphorylated, glycosylated, methylated, acetylated or 
similarly modified polypeptides can be employed provided that they do not 
substantially affect the activity of the serine protease inhibitors. 

Derivatives of the nucleic acids according to the invention which have modified 
triplet structures in accordance with codon usage may also be used. In addition, 
nucleic acids according to the invention also include those which are more stable 
towards degradation by nucleases as compared with the native compounds, for 
example, the corresponding SODN derivatives usually employed in antisense 
technology to give the antisense structures a more stable design towards enzy- 
matic attack. 

Structures homologous to the polypeptides may also be used. In particular, these 
include polypeptide structures in which amino acids have been exchanged. Thus, 
for example, conservative amino acid substitutions in highly conserved regions can 
be considered as follows: any isoleucine, valine and leucine amino acid can be 
exchanged for any other of these amino acids, aspartate can be exchanged for 
glutamate and vice versa, glutamine for asparagine and vice versa, serine for 
threonine and vice versa. Conservative amino acid substitutions in less highly 
conserved regions can be as follows: Any of the amino acids isoleucine, valine and 
leucine for any other of these amino acids, aspartate for glutamate and vice versa, 
glutamine for asparagine and vice versa, serine for threonine and vice versa, 
glycine for alanine and vice versa, alanine for valine and vice versa, any of the 
amino acids leucine, isoleucine or valine for methionine, lysine for arginine and 



- 7 - 



vice versa, either of the amino acids arginine or lysine for either of the annino acids 
aspartate or glutamate, either of the amino acids arginine or lysine for histidine, 
glutamine for glutamate and vice versa, and asparagine for aspartate and vice 
versa. 

The mode of action of the peptides according to the invention will be illustrated by 
the following Example. 

Example 

Measurement of protease inhibition by HF 7665 
Measuring composition: 

84 Ml measuring buffer (0.1 M HEPES, pH 7.5; 0.5 M NaCI) 
1 pi trypsin (1 mg/ml in 1 mM HCI, 20 mM CaClz) 

5 pi L-BABNA (6 mg/ml N-a-benzoyl-L-arginine-p-nitroanilide hydrochloride) 
10 pi protease inhibitor (10 pM or 75 pg/mi HF 7665 in H2O) 

The reaction was started by adding the chromogenic substrate, and the substrate 
conversion was followed by a photometer at X = 405 nm. After about fi^e minutes, 
10 pi of protease inhibitor or the corresponding controls were added and the 
further course of the absorbance observed. 

It could be shown that HF 7665 has an inhibitory effect on trypsin in a final 
concentration of about 1 pM or 7.5 pg/ml. Control experiments with corresponding 
amounts of BSA (7.5 pg/ml) and acetonitrile/TFA (0.8% ACN/0.001% TFA) did not 
show any trypsin inhibition. Further, an inhibitory effect of HF 7665 on chymotryp- 
sin could not be observed in a similar test. 

Figure 3 shows that the substrate conversion is reduced by about 30% due to 
trypsin inhibition after the addition of HF 7565. 
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(iii) NUMBER OF SEQUENCES: 34 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disl< 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 177 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Met Lys lie Ala Thr Val Ser Val Leu Leu Pro Leu Ala Leu Cys Leu 
15 10 15 
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He Gin Asp Ala Ala Ser Lys Asn Glu Asp Gin Glu Met Cys His Glu 
20 25 30 

Phe Gin Ala Phe Met Lys Asn Gly Lys Leu Phe Cys Pro Gin Asp Lys 
35 40 45 

Lys Phe Phe Gin Ser Leu Asp Gly He Met Phe He Asn Lys Cys Ala 
50 55 60 

Thr Cys Lys Met He Leu Glu Lys Glu Ala Lys Ser Gin Lys Arg Ala 
65 70 75 80 

Arg His Leu Ala Arg Ala Pro Lys Ala Thr Ala Pro Thr Glu Leu Asn 
85 90 95 

Cys Asp Asp Phe Lys Lys Gly Glu Arg Asp Gly Asp Phe He Cys Pro 
100 105 110 

Asp Tyr Tyr Glu Ala Val Cys Gly Thr Asp Gly Lys Thr Tyr Asp Asn 
115 120 125 

Arg Cys Ala Leu Cys Ala Glu Asn Ala Lys Thr Gly Ser Gin He Gly 
130 135 140 

Val Lys Ser Glu Gly Glu Cys Lys Ser Ser Asn Pro Glu Gin Val Arg 
145 150 155 160 



Ser He Val Ser Leu Met Gly Asn Thr Gly Arg Leu Thr Ser Asn Ser 
165 170 175 



Lys 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 922 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Lys He Ala Thr Val Ser Val Leu Leu Pro Leu Ala Leu Cys Leu 
15 10 15 



He Gin Asp Ala Ala Ser Lys Asn Glu Asp Gin Glu Met Cys His Glu 
20 25 30 
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Phe Gin Ala Phe Met Lys Asn Giy Lys Leu Phe Cys Pro Gin Asp Lys 
35 40 45 

Lys Phe Phe Gin Ser Leu Asp Giy lie Met Phe lie Asn Lys Cys Ala 
50 55 60 

Thr Cys Lys Met lie Leu Glu Lys Glu Ala Lys Ser Gin Lys Arg Ala 
65 70 75 80 

Arg His Leu Ala Arg Ala Pro Lys Ala Thr Ala Pro Thr Glu Leu Asn 
85 90 95 

Cys Asp Asp Phe Lys Lys Giy Glu Arg Asp Giy Asp Phe lie Cys Pro 
100 105 110 

Asp Tyr Tyr Glu Ala Val Cys Giy Thr Asp Giy Lys Thr Tyr Asp Asn 
115 120 125 

Arg Cys Ala Leu Cys Ala Glu Asn Ala Lys Thr Giy Ser Gin lie Giy 
130 135 140 

Val Lys Ser Glu Giy Glu Cys Lys Ser Ser Asn Pro Glu Gin Asp Val 
145 150 155 160 

Cys Ser Ala Phe Arg Pro Phe Val Arg Asn Giy Arg Leu Giy Cys Thr 
165 170 175 

Arg Glu Asn Asp Pro Val Leu Giy Pro Asp Giy Lys Thr His Giy Asn 
180 185 190 

Lys Cys Ala Met Cys Ala Glu Leu Phe Leu Lys Glu Ala Glu Asn Ala 
195 200 205 

Lys Arg Glu Giy Glu Thr Arg lie Arg Arg Asn Ala Glu Lys Asp Phe 
210 215 220 

Cys Lys Glu Tyr Glu Lys Gin Val Arg Asn Giy Arg Leu Phe Cys Thr 
225 230 235 240 

Arg Glu Ser Asp Pro Val Arg Giy Pro Asp Giy Arg Met His Giy Asn 
245 250 255 

Lys Cys Ala Leu Cys Ala Glu lie Phe Lys Arg Arg Phe Ser Glu Glu 
260 265 270 

Asn Ser Lys Thr Asp Gin Asn Leu Giy Lys Ala Glu Glu Lys Thr Lys 
275 280 285 

Val Lys Arg Glu lie Val Lys Leu Cys Ser Gin Tyr Gin Asn Gin Ala 
290 295 300 

Lys Asn Giy lie Leu Phe Cys Thr Arg Glu Asn Asp Pro lie Arg Giy 
305 310 315 320 

Pro Asp Giy Lys Met His Giy Asn Leu Cys Ser Met Cys Gin Val Tyr 
325 330 335 

Phe Gin Ala Glu Asn Glu Glu Lys Lys Lys Ala Glu Ala Arg Ala Arg 
340 345 350 



- 11 - 



Asn Lys Arg Glu Ser Giy Lys Ala Thr Ser Tyr Ala Giu Leu Cys Asn 
355 360 365 

Glu Tyr Arg Lys Leu Val Arg Asn Gly Lys Leu Ala Cys Thr Arg Glu 
370 375 380 

Asn Asp Pro lie Gin Gly Pro Asp Gly Lys Val His Gly Asn Thr Cys 
385 390 395 400 

Ser Met Cys Glu Val Phe Phe Gin Ala Glu Glu Glu Glu Lys Lys Lys 
405 410 415 

Lys Glu Gly Glu Ser Arg Asn Lys Arg Gin Ser Lys Ser Thr Ala Ser 
420 425 430 

Phe Glu Glu Leu Cys Ser Glu Tyr Arg Lys Ser Arg Lys Asn Gly Arg 
435 440 445 

Leu Phe Cys Thr Arg Glu Asn Asp Pro lie Gin Gly Pro Asp Gly Lys 
450 455 460 

Met His Gly Asn Thr Cys Ser Met Cys Glu Ala Phe Phe Gin Gin Glu 
465 470 475 480 

Glu Arg Ala Arg Ala Lys Ala Lys Arg Glu Ala Ala Lys Glu lie Cys 
485 490 495 

Ser Glu Phe Arg Asp Gin Val Arg Asn Gly Thr Leu lie Cys Thr Arg 
500 505 510 

Glu His Asn Pro Val Arg Gly Pro Asp Gly Lys Met His Gly Asn Lys 
515 520 525 

Cys Ala Met Cys Ala Ser Val Phe Lys Leu Glu Glu Glu Glu Lys Xys 
530 535 540 

Asn Asp Lys Glu Glu Lys Gly Lys Val Glu Ala Glu Lys Val Lys Arg 
545 550 555 560 

Glu Ala Val Gin Glu Leu Cys Ser Glu Tyr Arg His Tyr Val Arg Asn 
565 570 575 

Gly Arg Leu Pro Cys Thr Arg Glu Asn Asp Pro He Glu Gly Leu Asp 
580 585 590 

Gly Lys He His Gly Asn Thr Cys Ser Met Cys Glu Ala Phe Phe Gin 
595 600 605 

Gin Glu Ala Lys Glu Lys Glu Arg Ala Glu Pro Arg Ala Lys Val Lys 
610 615 620 

Arg Glu Ala Glu Lys Glu Thr Cys Asp Glu Phe Arg Arg Leu Leu Gin 
625 630 635 640 



Asn Gly Lys Leu Phe Cys Thr Arg Glu Asn Asp Pro Val Arg Gly Pro 
645 650 655 
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Asp Gly Lys Thr His Gly Asn Lys Cys Ala Met Cys Lys Ala Val Phe 
660 665 670 

Gin Lys Glu Asn Glu Glu Arg Lys Arg Lys Giu Glu Glu Asp Gin Arg 
675 680 685 

Asn Ala Ala Gly His Gly Ser Ser Gly Gly Gly Gly Gly Asn Thr Gin 
690 695 700 

Asp Glu Cys Ala Glu Tyr Arg Glu Gin Met Lys Asn Gly Arg Leu Ser 
705 710 715 720 

Cys Thr Arg Glu Ser Asp Pro Val Arg Asp Ala Asp Gly Lys Ser Tyr 
725 730 735 

Asn Asn Gin Cys Thr Met Cys Lys Ala Lys Leu Glu Arg Glu Ala Glu 
740 745 750 

Arg Lys Asn Glu Tyr Ser Arg Ser Arg Ser Asn Gly Thr Gly Ser Glu 
755 760 765 

Ser Gly Lys Asp Thr Cys Asp Glu Phe Arg Ser Gin Met Lys Asn Gly 
770 775 780 

Lys Leu lie Cys Thr Arg Glu Ser Asp Pro Val Arg Gly Pro Asp Gly 
785 790 795 800 

Lys Thr His Gly Asn Lys Cys Thr Met Cys Lys Giu Lys Leu Glu Arg 
805 810 815 

Glu Ala Ala Glu Lys Lys Arg Lys Arg Met Lys Thr Gly Ala lie Gin 
820 825 830 

Glu Lys Gly Ala He Gin Glu Lys Gly Ala Met Thr Lys Arg He Cys 
835 840 845 

Val Val Asn Phe Glu Ala Cys Arg Glu Met Glu Ser Leu Ser Ala Pro 
850 855 860 

Glu Lys He Thr Leu Phe Glu Ala His Met Ala Arg Cys Thr Ser He 
865 870 875 880 

Asn Val Leu Cys Val Arg Ala Ser Leu He Glu Lys Leu Met Lys Glu 
885 890 895 

Lys Arg Lys Met Lys Arg Asn Gin Val Ala Ser Pro Gin He Met Gin 
900 905 910 



Arg Met Ser Ala Val Asn Phe Glu Thr He 
915 920 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Lys Asn Glu Asp Gin Glu Met Cys His Glu Phe Gin Ala Phe Met Lys 
15 10 15 

Asn Gly Lys Leu Phe Cys Pro Gin Asp Lys Lys Phe Phe Gin Ser Leu 
20 25 30 

Asp Gly lie Met Phe lie Asn Lys Cys Ala Thr Cys Lys Met He Leu 
35 40 45 

Glu Lys Glu Ala Lys Ser Gin 
50 55 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino adds 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Glu Ser Gly Lys Ala Thr Ser Tyr Ala Glu Leu Cys Asn Glu Tyr Arg 
15 10 15 

Lys Leu Val Arg Asn Gly Lys Leu Ala Cys Thr Arg Glu Asn Asp Pro 
20 25 30 



lie Gin Gly Pro Asp Gly Lys Val His Gly Asn Thr Cys Ser Met Cys 
35 40 45 
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Glu Val Phe Phe Gin Ala Glu Glu Glu Glu Lys Lys Lys Lys Glu Gly 
50 55 60 

Glu Ser Arg Asn 
65 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 748 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATGCATGGAG TGGACCTGTA GGCGACTTGC ATCGTCTTCA ACATGAAGAX AGCCACAGTG 60 

TCAGTGCTTC TGCCCTTGGC TCTTTGCCTC ATACAAGATG CTGCCAGTAA GAATGAAGAT 120 

CAGGAAATGT GCCATGAATT TCAGGCATTT ATGAAAAATG GAAAACTGTT CTGTCCCCAG 180 

GATAAGAAAT TTTTTCAAAG TCTTGATGGA ATAATGTTCA TCAATAAATG TGCCACGTGC 24 0 

AAAATGATAC TGGAAAAAGA AGCAAAATCA CAGAAGAGGG CCAGGCATTT AGCAAGAGCT' 300 

CCCAAGGCTA CTGCCCCAAC AGAGCTGAAT TGTGATGATT TTAAAAAAGG AGAAAGAGAT 3 60 

GGGGATTTTA TCTGTCCTGA TTATTATGAA GCTGTTTGTG GCACAGATGG GAAAACATAT 4 20 

GACAACAGAT GTGCACTGTG TGCTGAGAAT GCGAAAACCG GGTCCCATVAT TGGTGTAAAA 480 

AGTGAAGGGG AATGTAAGAG CAGTAATCCA GAGCAGGTGA GGTCAATTGT CAGCCTGATG 54 0 

GGAAATACTG GGAGGCTAAC TTCAAATAGT AAGTAGGTGC TGTCCTCTTC CTTCTTAGGT 600 

GGGAGCCTTG GAAGGAATTA ATTCTTGCTT TATGTGAAAT GGAATACCCA GTTACTGCCC 660 

ACTAATATGA ATU^lAGCTAAT TATAGTCTCT GAAACTGGAT CAGATTACTT TGGTGGTTAA 720 

GATCTTTCAA TCTATTGCTG CTTTGTAT 7 48 



- 15 - 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3531 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(it) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

ATGCATGGAG TGGACCTGTA GGCGACTTGC ATCGTCTTCA ACATGAAGAT AGCCACAGTG 60 

TCAGTGCTTC TGCCCTTGGC TCTTTGCCTC ATACAAGATG CTGCCAGTAA GAATGAAGAT 120 

CAGGAAATGT GCCATGAATT TCAGGCATTT ATGAAAAATG GAT^CTGTT CTGTCCCCAG 180 

GATAAGAAAT TTTTTCAAAG TCTTGATGGA ATAATGTTCA TCAATAAATG TGCCACGTGC 24 0 

AAAATGATAC TGGAAAT^GA AGCAAAATCA CAGAAGAGGG CCAGGCATTT AGCAAGAGCT 300 

CCCAAGGCTA CTGCCCCAAC AGAGCTGAAT TGTGATGATT TTAAAAAAGG AGAAAGAGAT 360 

GGGGATTTTA TCTGTCCTGA TTATTATGAA GCTGTTTGTG GCACAGATGG GAAAACATAT 420 

GACAACAGAT GTGCACTGTG TGCTGAGAAT GCGAAAACCG GGTCCCAAAT TGGTGTAAAA 480 

AGTGAAGGGG AATGTi\AGAG CAGTAATCCA GAGCAGGATG TATGCAGTGC TTTTCGGCCC 540 

TTTGTTAGAA ATGGAAGACT TGGATGCACA AGGGAAAATG ATCCTGTTCT TGGTCCTGAT 600 

GGGAAGACGC ATGGCAATAA GTGTGCAATG TGTGCTGAGC TGTTTTTAAA AGAAGCTGAA 660 

AATGCCAAGC GAGAGGGTGA AACTAGAATT CGACGAAATG CTGAAAAGGA TTTTTGCAAG 720 

GAATATGAAA AACAAGTGAG AAATGGAAGG CTTTTTTGTA CACGGGAGAG TGATCCAGTC 780 

CGTGGCCCTG ACGGCAGGAT GCATGGCAAC AAATGTGCCC TGTGTGCTGA AATTTTCAAG 84 0 

CGGCGTTTTT CAGAGGAAAA CAGTAAAACA GATCAAAATT TGGGAAAAGC TGAAGAAAAA 900 

ACTAAAGTTA AAAGAGAAAT TGTGAAACTC TGCAGTCAAT ATCAAAATCA GGCAAAGAAT 960 

GGAATACTTT TCTGTACCAG AGAAAATGAC CCTATTCGTG GTCCAGATGG GAAAATGCAT 1020 

GGCAACTTGT GTTCCATGTG TCAAGTCTAC TTCCAAGCAG AAAATGAAGA AAAGAAAAAG 1080 

GCTGAAGCAC GAGCTAGAAA CAAAAGAGAA TCTGGAAAAG CAACCTCATA TGCAGAGCTT 1140 

TGCAATGAAT ATCGAAAGCT TGTGAGGAAC GGAAAACTTG CTTGCACCAG AGAGAACGAT 1200 
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CCTATTCAGG GCCCAGATGG GAAAGTGCAC GGCAACACCT GCTCCATGTG TGAGGTTTTT 1260 

TTCCAAGCAG AAGAAGAAGA AAAGAAAAAG AAGGAAGGCG AATCAAGAAA CAAAAGACAA 1320 

TCTAAGAGTA CAGCTTCCTT TGAGGAGTTG TGTAGTGAAT ACCGCAAATC CAGGAAAAAC 1380 

GGACGGCTTT TTTGCACCAG AGAGAATGAC CCCATCCAGG GCCCAGATGG GAAAATGCAT 1440 

GGCAACACCT GCTCCATGTG TGAGGCCTTC TTTCAACAAG AAGAAAGAGC AAGAGCAAAG 1500 

GCTAAAAGAG AAGCTGCAAA GGAAATCTGC AGTGAATTTC GGGACCAAGT GAGGAATGGA 1560 

ACACTTATAT GCACCAGGGA GCATAATCCT GTCCGTGGAC CAGATGGCAA AATGCATGGA 1620 

AACAAGTGTG CCATGTGTGC CAGTGTGTTC AAACTTGAAG AAGAAGAGAA GAAAAATGAT 1680 

AAAGAAGAAA AAGGGAAAGT TGAGGCTGAA AAAGTTAAGA GAGAAGCAGT TCAGGAGCTG 17 40 

TGCAGTGAAT ATCGTCATTA TGTGAGGAAT GGACGACTCC CCTGTACCAG AGAGAATGAT 1800 

CCTATTGAGG GTCTAGATGG GAAAATCCAC GGCAACACCT GCTCCATGTG TGAAGCCTTC 1860 

TTCCAGCAAG AAGCAAAAGA AAAAGT^GA GCTGAACCCA GAGCAAAAGT CAAAAGAGAA 1920 

GCTGAAAAGG AGACATGCGA TGAATTTCGG AGACTTTTGC AAAATGGAAA ACTTTTCTGC 1980 

ACAAGAGAAA ATGATCCTGT GCGTGGCCCA GATGGCAAGA CCCATGGCAA CAAGTGTGCC 2040 

ATGTGTAAGG CAGTCTTCCA GAAAGAAAAT GAGGAAAGAA AGAGGAAAGA AGAGGAAGAT 2100 

CAGAGAAATG CTGCAGGACA TGGTTCCAGT GGTGGTGGAG GAGGAAACAC TCAGGACGAA 2160 

TGTGCTGAGT ATCGGGAACA AATGAAAAAT GGAAGACTCA GCTGTACTCG GGAGAGTGAT 2220 

CCTGTACGTG ATGCTGATGG CAAATCGTAC AACAATCAGT GTACCATGTG TAAAGCAAAA, 2280 

TTGGAAAGAG AAGCAGAGAG AAAAAATGAG TATTCTCGCT CCAGATCAAA TGGGACTGGA 2340 

TCAGAATCAG GGAAGGATAC ATGTGATGAG TTTAGAAGCC AAATGAAAAA TGGAAAACTT 2400 

ATCTGCACTC GAGAAAGTGA CCCTGTCCGG GGTCCAGATG GCAAGACACA TGGTAATAAG 24 60 

TGTACTATGT GTAAGGAAAA ACTGGAAAGG GAAGCAGCTG AAAAAAAAAG AAAGAGGATG 2520 

AAGACAGGAG CAATACAGGA GAAAGGAGCA ATACAGGAGA AAGGAGCAAT GACAAAGAGG 2580 

ATCTGTGTCG TGAATTTCGA AGCATGCAGA GAAATGGAAA GCTTATCTGC ACCAGAGAAA 2640 

ATAACCCTGT TCGAGGCCCA TATGGCAAGA TGCACATCAA TAAATGTGCT ATGTGTCAGA 27 00 

GCATCTTTGA TCGAGAAGCT AATGAAAGAA AAAAGAAAGA TGAAGAGAAA TCAAGTAGCA 27 60 

AGCCCTCAAA TT^TGCAAAG GATGAGTGCA GTGAATTTCG AAACTATATA AGGAACAATG 2820 

AACTCATCTG CCCTAGAGAG AATGACCCAG TGCACGGTGC TGATGGAAAG TTCTATACAA 2880 

ACAAGTGCTA CATGTGCAGA GCTGTCTTTC TAACAGAAGC TTTGGAAAGG GCAAAGCTTC 2 940 

AAGAAAAACC ATCCCATGTT AGAGCTTCTC AAGAGGAAGA CAGCCCAGAC TCTTTCAGTT 3000 
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CTCTGGATTC TGAGATGTGC AAAGACTACC GAGTATTGCC CAGGATAGGC TATCTTTGTC 3060 

CAAAGGATTT AAAGCCTGTC TGTGGTGACG ATGGCCAAAC CTACAACAAT CCTTGCATGC 3120 

TCTGTCATGA AAACCTGATA CGCCAAACAA ATACACACAT CCGCAGTACA GGGAAGTGTG 3180 

AGGAGAGCAG CACCCCAGGA ACCACCGCAG CCAGCATGCC CCCGTTTGAC GAATGACAGG 3240 

AAGATTGTTG AAAGCCATGA GGGAAAAAAT AAACCCCAGT TTTGAATCAC CTACCTTCAC 3300 

CATCTGTATA TACAAAGAAT TTTTCGGAGC TTGTTTTATT TGCTATAGAA AACAATACAG 3360 

AGCTTTTGGG AATGGAATCA CTGATTTTCA GTCTTTTCCA TTTCTTTCCT CCTAGAATCT 3420 

GTGATCTGAG GGTATAAAGA CATTTCCACC AAGTTTGAGC CCTCAAAATG TCCTGATTAC 34 80 

AATGCTGTCT GTCCAACTGC CTGTTCAATA AAAGTAAACT CAGCAGAAAA A 3531 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

His Glu Phe Gin Ala Phe Met Lys Asn Gly Lys Leu Phe 
15 10 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unlcnown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
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Ser Glu Tyr Arg Lys Ser Arg Lys Asn Gly Arg Leu Phe 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 13 amino acids 
CB) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unl<nown 



(ii) I^OLECULETYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Asp Asp Phe Lys Lys Gly Glu Arg Asp Gly Asp Phe He 
1 5 10 

(2) INFORIVIATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unlcnown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Ser Glu Phe Arg Asp Gin Val Arg Asn Gly Thr Leu He 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 11: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Ser Ala Phe Arg Pro Phe Val Arg Asn Gly Arg Leu Gly 
15 10 



(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Ser Glu Tyr Arg His Tyr Val Arg Asn Gly Arg Leu Pro 
^5 10 

(2) INFORMATION FOR SEQ ID NO: 13: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Lys Glu Tyr Glu Lys Gin Val Arg Asn Gly Arg Leu Phe 
^5 10 
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(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(li) I^IOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Asp Glu Phe Arg Arg Leu Leu Gin Asn Gly Lys Leu Phe 
15 10 

(2) INFORI^ATION FOR SEQ ID NO: 15: 

(!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Ser Gin Tyr Gin Asn Gin Ala Lys Asn Gly lie Leu Phe 
15 10 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino adds 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Ala Glu Tyr Arg Glu Gin Met Lys Asn Gly Arg Leu Ser 
15 10 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Asn Glu Tyr Arg Lys Leu Val Arg Asn Gly Lys Leu Ala 
15 10 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



Asp Glu Phe Arg Ser Gin Met Lys Asn Gly Lys Leu lie 
15 10 
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(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Pro Gin Asp Lys Lys Phe Phe Gin Ser Leu Asp Gly lie Met Phe lie 
15 10 15 

Asn Lys 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Thr Arg Glu Asn Asp Pro lie Gin Gly Pro Asp Gly Lys Met His Gly 
15 10 15 

Asn Thr 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Thr Arg Glu Asn Asp Pro Val Leu Gly Pro Asp Gly Lys Thr His Gly 
15 10 15 

Asn Lys 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Thr Arg Glu His Asn Pro Val Arg Gly Pro Asp Gly Lys Met His Gly 
15 10 15 

Asn Lys 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Thr Arg Glu Ser Asp Pro Val Arg Gly Pro Asp Gly Arg ^flet His Gly 
15 10 15 

Asn Lys 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Thr Arg Glu Asn Asp Pro lie Glu Gly Leu Asp Gly Lys lie His Gly 
15 10 15 

Asn Thr 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Thr Arg Glu Asn Asp Pro lie Arg Gly Pro Asp Gly Lys Met His Gly 
15 10 15 

Asn Leu 
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(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; unknown 

(ii) IVIOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Thr Arg Glu Asn Asp Pro Val Arg Gly Pro Asp Gly Lys Thr His Gly 
15 10 15 

Asn Lys 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unl<nown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Thr Arg Glu Asn Asp Pro lie Gin Gly Pro Asp Gly Lys Val His Gly 
15 10 15 

Asn Thr 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Thr Arg Glu Ser Asp Pro Val Arg Asp Ala Asp Gly Lys Ser Tyr Asn 
15 10 15 

Asn Gin 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Thr Arg Glu Ser Asp Pro Val Arg Gly Pro Asp Gly Lys Thr His Gly 
15 10 15 

Asn Lys 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Ala Thr 
1 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unl<nown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Ala Leu 
1 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unl<nown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Ala Met 
1 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHAJ^CTERISTICS: 

(A) LENGTH: 2 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Ser Met 
1 

(2) INFORI^IATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(11) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Thr Met 
1 
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CLAIMS: 



1. A serine protease inhibitor, characterized by having a domain with 
four cysteines, and a sequence of 13 amino acids is present between 
the first and second cysteines, a sequence of 18 amino acids is pre- 
sent between the second and third cysteines, and a sequence of 2 
amino acids is present between the third and fourth cysteines. 

2. The serine protease Inhibitor according to claim 1, characterized In 
that the sequence of the domain between the first and second cys- 
teines is selected from 



3. The serine protease inhibitor according to any of claims 1 and/or 2, 
characterized In that the sequence between the second and third cys- 
teines is selected from 



HEFQAFI^KNGKLF, 
DDFKKGERDGDFI, 
SAFRPFVRNGRLG, 
KEYEKQVRNGRLF, 
SQYQNQAKNGILF, 
NEYRKLVRNGKLA, 



AEYREQI^^KIMGRLS, or 
DEFRSQIVIKNGKLI. 



SEYRKSRKNGRLF, 
SEFRDQVRNGTLI, 
SEYRHYVRNGRLP, 
DEFRRLLQNGKLF, 



PQDKKFFQSLDGIMRNK, 

TRENDPVLGPDGKTHGNK, 

TRESDPVRGPDGRMHGNK, 

TRENDPIRGPDGKMHGNL, 

TRENDPIQGPDGKVHGNT, 

TRESDPVRGPDGKTHGNK. 



TRESDPVRDADGKSYNNQ, or 



TRENDPIQGPDGKMHGNT, 
TREHNPVRGPDGKMHGNK, 
TRENDPIEGLDGKIHGNT, 
TRENDPVRGPDGKTHGNK, 
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4. The serine protease inhibitor according to any of claims 1 to 3, 
characterized in that the sequence between the third and fourth cys- 
teines of the domain is selected from 

AT, AL, AM, SI^, or TIM, 

5. The serine protease Inhibitor according to any of claims 1 to 4, having 
one of the following formulas: 

R1-C-HEFQAFMKNGKLF-C-PQDKKFFQSLDGIMFINK-C-AT-C-R2 

R1-C-DDFKKGERDGDFI-C-PDYYEAVCGTDGKTYDNR-C-AL-C-R2 

R1-C-SAFRPFVRNGRLG-C-TRENDPVLGPDGKTHGNK-C-AM-C-R2 

R1-C-KEYEKQVRNGRLF-C-TRESDPVRGPDGRMHGNK-C-AL-C-R2 

R1-C-SQYQNQAKNGILF-C-TRENDPIRGPDGKMHGNL-C-SM-C-R2 

R1-C-NEYRKLVRNGKLA-C-TRENDPIQGPDGKVHGNT-C-SM-C-R2 

R1-C-SEYRKSRKNGRLF-C-TRENDPIQGPDGKMHGNT-C-SM-C-R2 

R1-C-SEFRDQVRNGTLI-C-TREHNPVRGPDGKMHGNK-C-AM-C-R2 

R1-C-SEYRHYVRNGRLP-C-TRENDPIEGLDGKIHGNT-C-SM-C-R2 

Ri-C-DEFRRLLQNGKLF-C-TRENDPVRGPDGKTHGNK-C-AM-CrR2 

R1-C-AEYREQMKNGRLS-C-TRESDPVRDADGKSYNNQ-C-TM-C-R2 

Ri-C-DEFRSQMKNGKLI-C-TRESDPVRGPDGKTHGNK-C-TM-C-R2, 

wherein Ri is NH2, an amino acid, or a peptide with up to 1000 amino 
acids, and R2 is COOH, CONH2, an amino acid, or a peptide with up to 
1000 amino acids. 

6. The serine protease inhibitor according to at least one of claims 1 to 
5, characterized by containing 

a disulfide bridge between the first and fourth cysteines and/or 
between the second and third cysteines; or 
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a disulfide bridge between the first and fifth cysteines and/or 
between the second and fourth cysteines and/or between the 
third and sixth cysteines. 

7. The serine protease inhibitor according to at least one of claims 1 to 
6, characterized by being a fragment of VAKTI-l (SEQ. ID. NO, 1) or 
VAKTI-2 (SEQ. ID. NO. 2), 

8. The serine protease inhibitor according to claim 7, characterized by 
being HF 6479 (SEQ. ID. NO. 3) or HF 7665 (SEQ. ID. NO. 4). 

9. A nucleic acid coding for a serine protease inhibitor according to at 
least one of claims 1 to 8. 

10. A medicament containing at least one serine protease inhibitor 
according to at least one of claims 1 to 8 and/or a nucleic acid 
according to claim 9, optionally together with pharmaceutical 
vehicles. 

11. The medicament according to claim 10, containing from 0.01 to 1000 
mg per kg of body weight of the serine protease Inhibitor according to 
at least one of claims 1 to 8 and/or of the nucleic acid according to 
claim 9. 

12. Use of the serine protease inhibitor according to at least one of claims 
1 to 8 for preparing a medicament for the treatment of acute or 
chronic cervix inflammations, inflammations of Bartholin's glands and 
other vaginal regions, tonsillitis, pharyngitis and laryngitis, acute or 
chronic inflammatory processes accompanied by excessive formation 
of mucus and the resulting acute emergency situations, postoperative 
bleedings due to hyperfibrinolysis, and for the prophylaxis of lung 
emphysema formation in deficiencies of ai-proteinase inhibitor. 
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13. Use of the nucleic acids according to claim 9 for preparing a medica- 
ment for use in gene therapy for the curing and prophylaxis of dis- 
eases as mentioned in claim 12. 

14. Antibodies or antibody fragments against epitopes of the compounds 
according to any of claims 1 to 8. 

15. Poly- or oligonucleotides which will hybridize to regions of the cDNA 
or corresponding RNA under stringent conditions and optionally pre- 
vent the expression of coding regions of the genes coding for the 
compounds according to claims 1 to 8 (antisense compounds). 

16. A diagnostic agent containing at least one of the compounds accord- 
ing to claim 14 or 15. 

17. A medicament containing at least one of the compounds mentioned in 
claims 14 and/or 15 in therapeutically effective amounts. 

18. Use of the compounds according to claims 14 and/or 15 for preparing 
a medicament for the treatment of diseases Involving too high an ex- 
pression of the compounds according to at least one of claims 1 to 8, 
or too high an activity of the regions coding for the compounds ac- 
cording to claims 1 to 8. 

19. DNA, coding for the compounds mentioned in claims 1 to 8, and/or 
RNA involved in the transcription or translation of the compounds 
mentioned in claims 1 to 8. 



20. 



The DNA according to claim 19 having the SEQ. ID. NO. 5 or SEQ. ID. 
NO. 6. 
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Abstract 



A serine protease inhibitor, characterized by having a domain with four 
cysteines, and a sequence of 0 to 20 amino acids is present between the 
first and second cysteines, or the serine protease inhibitor has a domain 
with six cysteines, and a sequence of 7 to 20 amino acids is present 
between the first and second cysteines. 



1/6 

Fif ure 1 

VAKTI-1 cDNA and its translation into 
amino add sequence 
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Frame 2 M K I A 

ATG CAT GGA GTG GAC CTG TAG GCG ACT TGC ATC GTC TTC AAC ATG AAG ATA GCC 
10 19 28 37 46 55 

I MEMC-l-> 

1->HF6479 

TVSVLI^PIiAIiCLIQDAASlKN 
ACA GTG TCA GTG CTT CTG CCC TTG GOT CTT TGC CTC ATA CAA GAT GCT GCC AGT AAG AAT 
64 73 82 91 100 109 



-MEMC-1 ^> CHEF-1- 



ED QEMCHEFQAFMKNGKIiFC 
GAA GAT CAG GAA ATG TGC CAT GAA TTT CAG GCA TTT ATG AAA AAT GGA AAA CTG TTC TGT 
124 133 142 151 160 169 



-CHEF-14 

-<:HEF-11 > < CHEF-2- 



PQDKKFF QSLDGIMFINKCA 
CCC CAG GAT AAG AAA TTT TTT CAA.AGT CTT GAT GGA ATA ATG TTC ATC AAT AAA TGT GCC 
184 193 202 211 220 229 



< CHEF-2 HF6479 < { 

T C KM I li E KEAK S QIKRARH LA 
ACG TGC AAA ATG ATA CTG GAA AAA GAA GCA AAA TCA CAG AAG AGG GCC AGG CAT TTA GCA 
244 253 262 271 280 289 

RAPKATA PTELNCDDFK KGE 
AGA GCT CCC AAG GCT ACT GCC CCA ACA GAG CTG AAT TGT GAT GAT TTT AAA AAA GGA GAA 
304 - 313 322 331 340 349 

RDGDFICPDYYEAVCGTDGK 
AGA GAT GGG GAT TTT ATC TGT CCT GAT TAT TAT GAA GCT GTT TGT GGC ACA GAT GGG AAA 
3jS4 373 382 • 391 400 409 

TXDNRCAiCAENAKTGSQrG 
ACA TAT GAC AAC AGA TGT GCA CTG TGT GCT GAG AAT GCG AAA ACC GGG TCC CAA ATT GGT 
424 433 442 451 460 469 

^VKSEGECKSSH^E Q V R S I V S 
I 6TA AAA AGT GAA GGG GAA TGT AAG AGC AGT AAT CCA GAG CAG GTG AGG TCA ATT GTC AGC 
484 * ' 493 502 511 520 529 

IiMGNTGRIi TSKSK STOP 

CTG ATG GGA AAT ACT GGG AGG CTA ACT TCA AAT AGT AAG TAG GTG CTG TCC TCT TCC TTC 
544 553 562 571 580 589 



TTA GGT, GGG AGC CTT GGA AGG AAT TAA TTC TTG CTT TAT GTG AAA TGG AAT ACC CAG TTA 
604 613 622 631 640 649 

CTG CCC ACT AAT ATG AAA AAG CTA ATT ATA GTC TCT GAA ACT GGA TCA GAT TAG TTT GGT 
664 673 682 691 700 709 



GGT TAA GAT CTT TCA ATC TAT TGC TGC TTT GTA T 
724 733 742 749 
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Figure 2 2 / 

VAKTI-2 cDNA and its translation into 
amino acid sequence 



S 
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ATG 
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GGA 
10 


GTG 
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CTG 
19 


TAG 


GCG 


ACT 
28 


TGC 


ATC 


GTC 
37 


TTC 


AAC 


M 

ATG 
46 


K 

AAG 


I 

ATA 


A 

GCC 
55 


T 

ACA 


V 

GTG 


S 

TCA 
64 


V 

GTG 


L 

CTT 


L 

CTG 
73 


P 

CCC 


L 

TTG 


A 

GCT 
82 


L 

CTT 


C 

TGC 


L 

CTC 
91 


I 

ATA 


Q 

CAA 


D • 

GAT 

100 


A 

GCT 


A 

GCC 


S 

AGT 
109 


i— > 
IK 
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N 
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Repeat 1 
* 
























# 


E 

GAA 


D 

GAT 


Q 

CAG 
124 


E 

GAA 


M 

ATG 


TGC 
133 


u 

il 

CAT 


Hi 

GAA 


F 

TTT 
142 


Q 

CAG 


A 

GCA 


F 

TTT 
151 


M 

ATG 


K 

AAA 


AAT 
160 


G 

GGA 


K 

AAA 


L 

CTG 
169 


F 

TTC 
# 


C 

TGT 


P 

CCC 


Q 

CAG 


D 

GAT 
184 


K 

AAG 


K 

AAA 


F 

TTT 


F 

TTT 


Q 

CAA 


S 

AGT 
202 


h 

CTT 


D 

GAT 


G 

GGA 
211 


I 

ATA 


M 

ATG 


F 

TTC 
220 


I 

ATC 


N 

AAT 


K 

AAA 
229 


C 

TGT 


A 

GCC 


T 

ACG 


* 
C 

TGC 


K 

AAA 
244 


M 

ATG 


I 

ATA 


L 

CTG 
253 


E 

GAA 


K 

AAA 


E 

GAA 
262 


HF 6479 < — j 
A K S Q 1 
GCA AAA TCA CAG 
271 


K 

AAG 


R 

AGG 
280 


A 

GCC 


R 

AGG 


H 

CAT 
289 


L 

TTA 


A 

GCA 


























typical Kazal domain 








R 

:!! AGA 


A 

GCT 


P 

CCC 
304 


K 

AAG 


A 

GCT 


T 

ACT 
313 


A 

GCC 
# 


P 

CCA 


T 

ACA 
322 


E 

GAG 


L 

CTG 


N 

AAT 
331 


c 

TGT 


D 

GAT 


D 

GAT 
340 
+ 


F 

TTT 


K K 
AAA AAA 
349 


G E 
GGA GAA 


AGA 


D 

GAT 


G 

GGG 
364 


D 

GAT 


F 

TTT 


I 

ATC 
373 
# 


C 

TGT 


P 

CCT 


D 

GAT 

382 
* 


Y 

TAT 


Y 

TAT 


E 

GAA 
391 


A 

GCT 


V 

GTT 


C 

TGT 
400 


G 

GGC 


T 

ACA 


D 

GAT 
409 


G K 
GGG AAA 


ACA 


Y 

TAT 


U 

GAC 
424 


N 

AAC 


R. 
AGA 


C 

TGT 
433 


A 

GCA 


I. 

CTG 


C , 

TGT 

442 


A 

GCT 


E. N 
GAG AAT 
451 


A 

GCG 


K 

AAA 


T. 

ACC 
460 


G 

GGG 


S 

TCC 


Q 

CAA 
469 


I 

ATT 


G 

GGT 














+ 


















, Repeat 2 




:::V ksegeckssnpeqdvcsa f.^* 

=rjGTA AAA AGT GAA GGG GAA TGT AAG AGC AGT AAT CCA GAG CAG GAT GTA TGC AGT GCT TTT 
484 • 493 502 511 , 520 529 

# 


R 

CGG 


P 

CCC 


F 

TTT 
544 


V 

GTT 


R N 
AGA AAT 
553 


G R 
GGA AGA 


h 

■CTT 
562 


G C T R 
GGA TGC ACA AGG 
571 

# * 


E N 
GAA AAT 
. 580 


D 

GAT 


P 

CCT 


V 

GTT 
589 


L 

CTT 


G 

GGT 


P 

CCT 


D 

GAT 


G 

GGG 
604 


K • 
AAG 


T 

ACG 


H 

CAT 
€13 


G 

GGC 


N 

AAT 


K 

AAG 
622 


C 

TGT 


A H 
GCA ATG 
631 


C 

TGT 


A 

GCT 


£ 

GAG 
640 


L 

CTG 


F 

TTT 


L 

TTA 
€49 


K 

AAA 


E 

GAA 


A 

GCT 


E N 
GAA AAT 
664 


A 

GCC 


K 

AAG 


R £ 
CGA GAG 
€73 


G 

GGT 


E T R I 
GAA ACT AGA ATT 
682 691 


R 

CGA 


R N 
CGA AAT 
700 


A • 
GCT 


E 

GAA 


K 

AAG 
709 


D 

GAT 


F 

TTT 


Repeat 3 
























# 












C 

TGC 


K 

AAG 


E 

GAA 
724 


Y 

TAT 


E 

GAA 


K Q 
AAA CAA 
733 


V 

GTG 


R 

AGA 
742 


N 

AAT 


G 

GGA 


R 

AGG 
751 


L 

CTT 


F 

TTT 
# 


C 

TGT 
760 


T 

ACA 


R 

CGG 
* 


£ 

GAG 
769 


S 

AGT 


D 

GAT 


P 

CCA 


V 

GTC 


R 

CGT 
784 


G 

GGC 


P 

CCT 


D 

GAC 
793 


G 

GGC 


R 

AGG 


M 

ATG 
802 


H 

CAT 


G 

GGC 


N 

AAC 
811 


K 

AAA 


C 

TGT 


A 

GCC 
820 


h 

CTG 


C 

TGT 


A 

GCT 
829 


E 

GAA 


'f ' ' 
ATT 



FKRRFSEENSKTDQNLGKAE 
TTC AAG CGG CGT TTT TCA GAG GAA AAC AGT AAA ACA GAT CAA AAT- TTG GGA AAA GCT GAA 
844 853 862 871 880 889 
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Repeat 4 

EKTKVKREIVKL C S Q Y Q N Q A 
GAA AAA ACT AAA GTT AAA AGA GAA ATT GTG AAA CTC TGC AGT CAA TAT CAA AAT CAG GCA 
904 913 922 931 940 949 

# 

KNGILFCTRENDPIRGPDGK 
AAG AAT GGA ATA CTT TTC TGT ACC AGA GAA AAT GAC CCT ATT CGT GGT CCA GAT GGG AAA 

964 973 982 991 1000 1009 
# * 

MHGNLCSMCQVYFQAENEEK 
ATG CAT GGC AAC TTG TGT TCC ATG TGT CAA GTC TAC TTC CAA GCA GAA AAT GAA GAA AAG 
1024 1033 1042 1051 1060 1069 



( > HF 7665 

KKAEARARNKRIESGKATSYA 
AAA AAG GCT GAA GCA CGA GCT AGA AAC AAA AGA GAA TCT GGA AAA GCA ACC TCA TAT GCA 
1084 1093 1102 1111 1120 1129 



Repeat 5 



# 



E LCNEYRKLVRNGKLACTRE 
GAG CTT TGC AAT GAA TAT CGA- AAG CTT GTG AGG AAC GGA AAA CTT GCT TGC ACC AGA GAG 

1144 1153 1162 1171 1180 1189 

# * 

NDPIQGPDGKVHGNTCSMCE 
AAC GAT CCT ATT CAG GGC CCA GAT GGG AAA GTG CAC GGC AAC ACC TGC TCC ATG TGT GAG 

1204 1213 1222 1231 1240 1249 



HF 7665 <" 



-1 



11 V FFQAEEEEKKKKEGE SRN|K 

GTT TTT TTC CAA GCA GAA GAA GAA GAA AAG * AAA AAG AAG GAA GGC GAA TCA AGA AAC AAA 
^1 1264 1273 1282 * 1291 1300 1309 



Repeat 6 



, R 


Q S 
CAA TCT 
1324 


K 

AAG 


S T 
AGT ACA 
1333 


A 

GCT 
# 


S F 
TCC TTT 
1342 


E 

GAG 


E L C 
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y 
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1384 


R 

CGG 


L F 
CTT TTT 
1393 
# 


C 

TGC 


T R E 
ACC AGA GAG 
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* 


N D P I Q 
AAT GAC CCC ATC CA(3 
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GGC 


P D 
CCA GAT 
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G K 
GGG AAA 
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H G 
CAT GGC 
1444 
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1453 


S 
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1462 


E 

GAG 


A F F 
GCC TTC TTT 
1471 


Q Q 
CAA CAA 
1480 


E 

GAA 


E R 
GAA AGA 
1489 


A R 
GCA AGA 
















Repeat 7 








A K A 
GCA AAG 'GCT 
1504 


K R E 
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1513 
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TGT GCC 
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AGT 


V F K 
GTG TTC AAA 
1651 


I. E 
CTT GAA 
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GAA 


E E. . K K 
GAA GAG AAG AAA 
1669 


N 

AAT 


D K 
GAT AAA 
1684 


E 

GAA 


E K 
GAA AAA 
1633 


G 

GGG 


K V 
AAA GTT 
1702 


E 

GAG 


A E K 
GCT GAA AAA 
1711 


V K 
GTT AAG 
1720 


R 

AGA 


E A V- Q 
GAA GCA GTT CAG 
1729 ^ - 




Repeat 8 
* 














# 




E 

GAG 


L C 
CTG TGC 
1744 


S 

AGT 


E Y 
GAA TAT 
1753 


R 

CGT 


H Y 
CAT TAT 
1762 


V 

GTG 


R N G 
AGG AAT GGA 
1771 


R L 
CGA CTC 
1780 


P 

CCC 


C T 
TGT ACC 
1789 


R E 

AGA GAG 
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NDPIEGLDGKIHGKTCSMCE 
AAT GAT CCT ATT GAG GGT CTA GAT GGG AAA ATC CAC GGC AAC ACC TGC TCC ATG TGT GAA 
1804 1813 1822 1831 1840 1849 



AFFQQEAKEKERAEPRAKVK 
GCC TTC TTC GAG CAA GAA GCA AAA GAA AAA GAA AGA GOT GAA CCC AGA GCA AAA GTC AAA 
1864 1873 1882 1891 1900 1909 

Repeat 9 



REAEKETCDEFRRLLQNGKL 
AGA GAA GOT GAA AAG GAG ACA TGC GAT GAA TTT CGG AGA CTT TTG CAA AAT GGA AAA CTT 
1924 1933 1942 1951 1960 1969 



N 



K 



H 



K 



TTC TGC ACA AGA GAA AAT GAT CCT GTG CGT GGC CCA GAT GGC AAG ACC CAT GGC AAC AAG 



1984 



1993 



2002 



2011 



2020 



2029 



K 



K 



N 



K 



K 



TGT GCC ATG TGT AAG GCA GTC TTC CAG AAA GAA AAT GAG GAA AGA AAG AGG AAA GAA GAG 



2044 



2053 



2062 



2071 



2080 



2089 



EDQRNA AGHGSSGGGGGNTQ 
= ' GAA GAT CAG AGA AAT GCT GCA GGA CAT GGT TCC AGT GGT GGT GGA GGA GGA AAC ACT CAG 
n 2104 2113 2122 2131 2140 2149 

U Repeat 10 



DECAEYREQMKNGRLSCTRE 
GAC GAA TGT GCT GAG TAT CGG GAA CAA ATG AAA AAT GGA AGA CTC AGC TGT ACT CGG GAG 
2164 2173 2182 2191 . 2200 2209 



S DPVRDADGKSYNNQCTMCK 
AGT GAT CCT GTA CGT GAT GCT GAT GGC AAA TCG TAC AAC AAT CAG TGT ACC ATG TGT AAA 
2224 2233 2242 2251 2260 2269 



OAKLEREAERKNEYSRSRSNG 
n GCA AAA TTG GAA AGA GAA GCA GAG AGA AAA AAT GAG TAT TCT CGC TCC AGA TCA AAT GGG 
2284 2293 2302 2311 2320 2329 



Repeat 11 



TGSESG KDTCDEFRSQMKNG 
ACT GGA TCA GAA TCA GGG AAG GAT ACA TGT GAT GAG TTT AGA AGC CAA ATG AAA AAT GGA 
2344 2353 2362 2371 2380 2389 



KLICTRESDPVRGPDGKTHG 
AAA CTT ATC TGC ACT CGA GAA AGT GAC CCT GTC CGG GGT CCA GAT GGC AAG ACA CAT GGT 
2404 2413 2422 2431 2440 2449 



NKCTMCKEKLERE AAEKKRK 
AAT AAG TGT ACT ATG TGT AAG GAA AAA CTG GAA AGG GAA GCA GCT GAA AAA AAA AGA AAG 
2464 2473 2482 2491 2500 2509 



RMKTGAIQEKGAIQEKGAMT 
AGG ATG AAG ACA GGA GCA ATA CAG GAG AAA GGA GCA ATA CAG GAG AAA GGA GCA ATG ACA 
2524 2533 2542 2551 2560 2569 



KRICVVNFEACREMESLSAP 
AAG AGG ATC TGT GTC GTG AAT TTC GAA GCA TGC AGA GAA ATG GAA AGC TTA TCT GCA CCA 
2584 2593 2602 2611 2620 2629 
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EKITLFEAHMARCTSINVLC 
GAG AAA ATA ACC CTG TTC GAG GCC CAT ATG GCA AGA TGC ACA TCA ATA AAT GTG CTA TGT 
2644 2653 2662 2671 2680 2689 

VRASLIEKLMKEKRKMKRNQ 
GTC AGA GCA TCT TTG ATC GAG AAG CTA ATG 7\AA GAA AAA AGA AAG ATG AAG AGA AAT CAA 
2704 2713 2722 2731 2740 2749 

VASPQIMQRMSAVNF ETI STOP 
GTA GCA AGC CCT CAA ATA ATG CAA AGG ATG AGT GCA GTG AAT TTC GAA ACT ATA TAA GGA 
2764 2773 2782 2791 2800 2809 



ACA ATG AAC TCA TCT GCC CTA GAG AGA ATG ACC CAG TGC ACG GTG CTG ATG GAA AGT TCT 
2824 2833 2842 2851 2860 2869 



ATA CAA ACA AGT GCT ACA TGT GCA GAG CTG TCT TTC TAA CAG AAG CTT TGG AAA GGG CAA 
2884 2893 2902 2911 2920 2929 



AGC TTC AAG AAA TIAC CAT CCC ATG TTA GAG CTT CTC AAG AGG AAG ACA GCC CAG ACT CTT 
2944 2953 2962 2971 2980 2989 



TCA GTT CTC TGG ATT CTG AGA TGT GCA AAG ACT ACC GAG TAT TGC CCA GGA TAG GCT ATC 
3004 3013 3022 3031 3040 3049 



TTT GTC CAA AGG ATT TAA AGC CTG TCT GTG GTG ACG ATG GCC AAA CCT ACA ACA ATC CTT 
3064 3073 3082 3091 3100 3109 



GCA TGC TCT GTC ATG AAA ACC TGA TAG GCC AAA CAA ATA CAC ACA TCC GCA GTA CAG GGA 
3124 3133 3142 3151 3160 3169 



AGT GTG AGG AGA GCA GCA CCC CAG GAA CCA CCG CAG CCA GCA TGC CCC CGT TTG ACG AAT 
3184 • 3193 3202 ' 3211 3220 3229 



GAC AGG AAG ATT GTT GAA AGC CAT GAG GGA AAA AAT AAA CCC CAG TTT TGA ATC ACC TAG 
3244 3253 3262 3271 3280 3289 



CTT CAC CAT CTG TAT ATA CAA AGA ATT TTT CGG AGC TTG TTT TAT TTG CTA TAG AAA ACA 
3304 3313 3322 3331 3340 3349 



ATA CAG AGC TTT TGG GAA TGG AAT CAC TGA TTT *TCA GTC TTT TCC ATT TCT TTC CTC CTA 
3364 3373 3382 3391 3400 3409 



GAA TCT GTG ATC TGA GGG TAT AAA GAC ATT TCC ACC AAG TTT GAG CCC TCA AAA TGT CCT 
3424 3433 3442 3451 3460 3469 

polyadenylation signal 

GAT TAG AAT GCT GTC TGT CCA ACT GCC TGT T CA ATA AAA GTA AAC TCA GCA GAA AAA 

3484 3493 3502 3511 3520 3529 



poIy(A) tail 
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Figure 3 
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As a below named inventor, i declare that my residence, post office address and citizenship are stated below next to my name, the information given herein is true, th^t i believe that i am Ithe origmai, 
first and sole inventor (if only one name is Iistecrat 201 below), or an onginal, first and joint inventor (if plural inventors are named below at 201-203, or on additional sheets attached' heretoybf the subject 
matter which is claimed and for wfiich patent is sought on the invention enttled' 

Serine Proteinase Inhibitors 



/ 



which is described and claimed In; 
I I the attached specification 



[xl PCT International ApplicatiOD No 

I I tiie specificatton m application Senal No. 
Of applicable) and amended on 



PCT/EP 98/08424 



filed 23 December 1998 

filed 



I hereby state that 1 have reviewed and understand the contents of the above- identified specification, including the claims, as amended by any amendment referred to above 
1 acknowledge the duty to disclose information which is material to patentability as defined in Title 37. Code of Federal Regulations, §1 .56 

i hereby claim foreign pnority benefits under Title 35, United States Code, §119 (a)-(d) of any foreign application(s) for patent or inventor's certificate listed below and have also identified below any 
foreign application for patent or inventor's certificate having a filing date before that of the application on which prionty is claimed: 

Prior Foreign Application(s) 



197 


57 


572 


• 2 


Germany 


23 December 1997 


(Number) 

198 


00 


363 


.3 


(Country) 

Germany 


(Day/MonthA'ear Filed) 

08 January 1998 


(Number) 








(Country) 


^ {Day/Month/Year Filed) 


(Number) 








(Country) 


(Day/MonthA'ear Filed) 



Priority Claimed 


m 


□ 


Yes 


No 




□ 


Yes 


No 


□ 


□ 


Yes 


No 



I hereby claim the benefit under Title 35, United States Code,§119{e) of any United States provisional applicat!on(s) listed below: 
Application No. Filing Date Application No 



Filing Date _ 



I hereby claim the benefit under Title 35, United States Code, §120 of any United States applicaticn(s) listed below and, insofar as the subject matter of each of the claims of this application is not 
disclosed in the pnor United States application in the manner provided by the first paragraph of Title 35, United States Code, §112, 1 acknowledge the duty to disclose information which is material 
to patentability as defined m Title 37, Code of Federal Regulations, §1 56 which became available between the filing date of the pnor application and the national or PCT international filing date of 
this application 



(Application Senal No ) 



(Filing Date) 



(Status patented, pending, abandoned) 



pMnER OF ATTORNEY: As a named inventor, I hereby appoint the following attorneys (Registration No. ) to prosecute this application, receive and act on instructions from nny 
a^t, and transact ail business in the Patent and Trademark Office connected therewith. HARVEY B. JACOBSON, JR. (2Q,85D .: D. DOUGLAS PRICE Q4,514); J OHN CLARKE 
HClMAN42ZJ§9)JWARViN R. STERNjm64QilALLEN S. MELSE R (27,215): MICHAEL R. 5LOBASKY.( 2.6.421) : JONATHAN L. SCHERER (2aa5i£pVira. AiSENBERG 
(hli007); WILLIAM E. PLAYER (31,40911; YOON S. HAM (45.3Q7).arm'NAWNIEL A. HUMPHRIE S (22.772) 
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CUSTOMER NC 
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PROFESSIONa^^ 

400 SEVENTH STREirTNiwr" " 
'^WASHINGTON, D.C. 20663: 




DIRECT TELEPHONE CALLS TO: 

(please use Attorney's Docket No.) (202) 638-6666 

JACOBSON, PRICE, HOLMAN & STERN 

PROFESSIONAL LtMITED LIABILITY COMPANY 



*lriv'8ntor(s) name must include at least one unabbreviated first or middle name. 
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FULL NAME* 
;0F INVENTOR 


FAMILY NAME 

FQRSSMANN,. 


GIVEN NAME 

Wolf-Geora 


MIDDLE NAME 


; RESIDENCE & 
CITIZENSHIP 


CITY 
Hannov*=r 


STATE OR FOREIGN COUNTRY jN . 'N / 

Germany ijJjLsf^ 


COUNTRY OF CITIZENSHIP 

German 


POST OFFICE 
ADDRESS 


POST OFFICE ADDRESS 

Feodor-Lynen-Strasse 31 


CITY 

Hannover 


STATE OR COUNTRY 

Germany 


ZIP CODE 

D-30625 


o 
« 


FULL NAME* 
OF INVENTOR 


FAMJLY NAME 

MAGERT 


GIVEN NAME 


MIDDLE NAME 


RESIDENCE & 
, CITIZENSHIP 


^ffannover 


STATE OR FOREIGN COUNTRY A / 

Germany ^ ,7\00 


COUNTRY OF CITIZENSHIP 
German 


POST OFFICE 
ADDRESS 


POST OFFICE ADDRESS 

Moltkeplatz 8 


CITY ^ 

Hannover 


STATE OR COUNTRY 

Germany 


ZIP CODE 

D-30163 
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FULL NAME * 
OF INVENTOR 


FAMILY NAME 
STANDKER 


GIVEN NAME 
..Tmrig,ar. .. .. 


MIDDLE NAME 
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CITIZENSHIP 


CITY 

Hannover 


STATE OR FOREIGN COUNTRY 
Germany 


COUNTRY OF CITIZENSHTPN.r^ / 

German U/li>/^ 


POST OFFICE 
. ADDRESS 


POST OFFICE ADDRESS 
Dohmeyers Weg 25 


CITY 

Hannover 


STATE OR COUNTRY 

Genriany 


ZIP CODE' 
D-30625 



I further declare that all statements made herein of my own knowledge are true and that all statements made on information and belief are believed to be true; and further that these 
statements were made with the knowledge that willful false statements and the like so made are punishable by fine or imprisonment or both, under section 1001 of Title 18 of the 
United States Code; and that such willful false statements may jeopardize the validity of the application or any patent issuing thereon. 
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OF INVENTOR 
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GIVEN NAME 
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STATE OR FOREIGN COUNTRY 


COUNTRY OF CmZENSHIP 
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CITY 
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